首页> 外文OA文献 >Computing on Masked Data: a High Performance Method for Improving Big Data Veracity

【2h】

Computing on Masked Data: a High Performance Method for Improving Big Data Veracity

机译：掩盖数据计算：一种改进大数据的高性能方法数据准确性

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The growing gap between data and users calls for innovative tools thataddress the challenges faced by big data volume, velocity and variety. Alongwith these standard three V's of big data, an emerging fourth "V" is veracity,which addresses the confidentiality, integrity, and availability of the data.Traditional cryptographic techniques that ensure the veracity of data can haveoverheads that are too large to apply to big data. This work introduces a newtechnique called Computing on Masked Data (CMD), which improves data veracityby allowing computations to be performed directly on masked data and ensuringthat only authorized recipients can unmask the data. Using the sparse linearalgebra of associative arrays, CMD can be performed with significantly lessoverhead than other approaches while still supporting a wide range of linearalgebraic operations on the masked data. Databases with strong support ofsparse operations, such as SciDB or Apache Accumulo, are ideally suited to thistechnique. Examples are shown for the application of CMD to a complex DNAmatching algorithm and to database operations over social media data.

机译：数据和用户之间日益扩大的差距要求使用创新的工具来应对大数据量，速度和多样性所面临的挑战。与大数据的这三个标准V一起，真实性出现了第四个“ V”，它解决了数据的机密性，完整性和可用性。确保数据真实性的传统加密技术可能会有太大的开销，无法应用于大数据数据。这项工作引入了一种新技术，称为“基于屏蔽数据的计算”（CMD），它允许直接在屏蔽数据上执行计算并确保只有授权的接收者才能对数据进行屏蔽，从而提高了数据准确性。使用关联数组的稀疏线性代数，可以以比其他方法少得多的开销来执行CMD，同时仍然支持对掩码数据进行广泛的线性代数运算。强烈支持稀疏操作的数据库（例如SciDB或Apache Accumulo）非常适合此技术。给出了将CMD应用到复杂的DNA匹配算法以及社交媒体数据上的数据库操作的示例。

著录项

作者
Kepner, Jeremy; Gadepally, Vijay; Michaleas, Pete; Schear, Nabil; Varia, Mayank; Yerukhimovich, Arkady; Cunningham, Robert K.;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Methods for Assessing, Predicting, and Improving Data Veracity: A survey [J] . Fatmah Assiri Advances in Distributed Computing And Artificial Intelligence Journal . 2020,第4期

机译：评估，预测和提高数据准确性的方法：调查
2. A high-performance computing method for data allocation in distributed database systems [J] . Ismail Omar Hababeh, Muthu Ramachandran, Nicholas Bowring Journal of supercomputing . 2007,第1期

机译：分布式数据库系统中数据分配的高性能计算方法
3. Improving the Veracity of Open and Real-Time Urban Data [J] . GAVIN MCARDLE, ROB KITCHIN Built environment . 2016,第3期

机译：改善开放实时城市数据的准确性
4. Computing on masked data: a high performance method for improving big data veracity [C] . Kepner Jeremy, Gadepally Vijay, Michaleas Pete, IEEE Conference on High Performance Extreme Computing . 2014

机译：在屏蔽数据上进行计算：一种提高大数据准确性的高性能方法
5. A Case Study on Determining the Big Data Veracity: A Method to Compute the Relevance of Twitter Data [D] . Paryani, Jyotsna. 2017

机译：确定大数据准确性的案例研究：一种计算Twitter数据相关性的方法
6. Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies [O] . Tahsin Kurc, Xin Qi, Daihou Wang, 2015

机译：使用高效方法和高性能计算策略可扩展地分析大病理图像数据
7. Computing on Masked Data to improve the Security of Big Data [O] . Gadepally, Vijay, Hancock, Braden, Kaiser, Benjamin, 2015

机译：计算掩码数据以提高大数据的安全性

Computing on Masked Data: a High Performance Method for Improving Big Data Veracity

摘要

著录项

相似文献

相关主题

期刊订阅